Performance evaluation and comparison of ITU-T/ETSI voice activity detectors
نویسندگان
چکیده
The paper proposes a performance evaluation and comparison of recent ITU-T and ETSI voice activity detection algorithms. The comparison was made using both objective and psychoacoustic parameters, so as to have reliable judgements that were close to subjective ones. A highly varied speech database was also set up to evaluate the extent to which VADs depend on language, the signal to noise ratio, or the power level.
منابع مشابه
Improving Voice Activity Detection Used in ITU-T G.729.B
In this paper, by using a new envelope estimation algorithm and a geometrical adaptive threshold method; we present a novel method to improve the performance of ITU-T G.729.B systems in various noisy environments. The proposed system has minimum change from this standard. We compare the performance of the proposed method with G.729B, ETSI AMR option 1 and 2 using objective measures. Key-Words: ...
متن کاملVoice activity detection algorithms using subband power distance feature for noisy environments
In this paper, we propose two robust voice activity detection (VAD) methods for adverse environments. A single subband power distance (SPD) feature is estimated from different wavelet subbands and further improved to be robust against noise. The first method is based on a neural network that operates on an input vector which consists of the SPD feature and its first and second derivatives. The ...
متن کاملStatistical Tests for Voice Activity Detection
A robust and effective voice activity detection (VAD) algorithm is proposed for improving speech recognition performance in noisy environments. The approach is based on filtering the input channel to avoid high energy noisy components and then the determination of the speech/non-speech bispectra by means of third order autocumulants. This algorithm differs from many others in the way the decisi...
متن کاملIndependent Component Analysis Applied to Voice Activity Detection
In this paper we present the first application of Independent Component Analysis (ICA) to Voice Activity Detection (VAD). The accuracy of a multiple observation-likelihood ratio test (MO-LRT) VAD is improved by transforming the set of observations to a new set of independent components. Clear improvements in speech/non-speech discrimination accuracy for low false alarm rate demonstrate the effe...
متن کاملBispectra Analysis-Based VAD for Robust Speech Recognition
A robust and effective voice activity detection (VAD) algorithm is proposed for improving speech recognition performance in noisy environments. The approach is based on filtering the input channel to avoid high energy noisy components and then the determination of the speech/non-speech bispectra by means of third order autocumulants. This algorithm differs from many others in the way the decisi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001